Training an articulatory synthesizer with continuous acoustic data

نویسندگان

  • Santitham Prom-on
  • Peter Birkholz
  • Yi Xu
چکیده

This paper reports preliminary results of our effort to address the acoustic-to-articulatory inversion problem. We tested an approach that simulates speech production acquisition as a distal learning task, with acoustic signals of natural utterances in the form of MFCC as input, VocalTractLab — a 3D articulatory synthesizer controlled by target approximation models as the learner, and stochastic gradient descent as the training method. The approach was tested on a number of natural utterances, and the results were highly encouraging.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Organization of a Neurocomputational Control Model for Articulatory Speech Synthesis

The organization of a computational control model of articulatory speech synthesis is outlined in this paper. The model is based on general principles of neurophysiology and cognitive psychology. Thus it is based on such neural control circuits, neural maps and mappings as are hypothesized to exist in the human brain, and the model is based on learning or training mechanisms similar to those oc...

متن کامل

Session 2aSC: Linking Perception and Production (Poster Session) 2aSC55. Speech sensorimotor learning through a virtual vocal tract

Studies of speech sensorimotor learning often manipulate auditory feedback by modifying isolated acoustic parameters such as formant frequency or fundamental frequency using near real-time resynthesis of a participant's speech. An alternative approach is to engage a participant in a total remapping of the sensorimotor working space using a virtual vocal tract. To support this approach for study...

متن کامل

Real-time control of a DNN-based articulatory synthesizer for silent speech conversion: a pilot study

This article presents a pilot study on the real-time control of an articulatory synthesizer based on deep neural network (DNN), in the context of silent speech interface. The underlying hypothesis is that a silent speaker could benefit from real-time audio feedback to regulate his/her own production. In this study, we use 3D electromagnetic-articulography (EMA) to capture speech articulation, a...

متن کامل

Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces

Restoring natural speech in paralyzed and aphasic people could be achieved using a Brain-Computer Interface (BCI) controlling a speech synthesizer in real-time. To reach this goal, a prerequisite is to develop a speech synthesizer producing intelligible speech in real-time with a reasonable number of control parameters. We present here an articulatory-based speech synthesizer that can be contro...

متن کامل

Knowledge from Speech Production Used in Speech Technology: Articulatory Synthesis*

There appears to be a continuing trend toward incorporating knowledge of speech production into s~eech technology-text-to-speech synthesis (e.g., BIckley, Stevens, & Williams, 1994; Parthasarthy & Coker, 1992), low bit rate coding (see Schroeter & Sondhi, 1992), and automatic speech recognition (e.g., Rose, Schroeter, & Sondhi, 1994; Shirai & Kobayashi, 1986). For automatic speech recognition, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013